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We study how the dynamic equilibrium of the reversible protein- 
protein binding network in yeast S. cerevisiae responds to large 
changes in abundances of individual proteins. The magnitude of 
shifts between free and bound concentrations of their immediate 
and more distant neighbors in the network is influenced by such 
factors as the network topology, the distribution of protein concen- 
trations among its nodes, and the average binding strength. Our 
primary conclusion is that on average the effects of a perturbation 
are strongly localized and exponentially decay with the network dis- 
tance away from the perturbed node. This explains why, despite 
globally connected topology, individual functional modules in such 
networks are able to operate fairly independently. We also found that 
under specific favorable conditions, realized in a significant number 
of paths in the yeast network, concentration perturbations can se- 
lectively propagate over considerable network distances (up to four 
steps). Such "action-at-a-distance" requires high concentrations of 
heterodimers along the path as well as low free (unbound) concen- 
tration of intermediate proteins. 

law of mass action | genetic interactions | dissociation constant | small-world 
networks | binding equilibrium 

Introduction 

Recent high-throughput experiments performed in a wide variety 
of organisms revealed networks of protein-protein physical interac- 
tions (PPI) that are interconnected on a genome-wide scale. In such 
"small-world" PPI networks most pairs of nodes can be linked to each 
other by relatively short chains of interactions involving just a few in- 
termediate proteins 1 1 1. While globally connected architecture facil- 
itates biological signaling and possibly ensures a robust functioning 
of the cell following a random failure of its components [2], it also 
presents a potential problem by providing a conduit for propagation 
of undesirable cross-talk between individual functional modules and 
pathways. Indeed, large (several-fold) changes in proteins' levels in 
the course of activation or repression of a certain functional module 
affect bound concentrations of their immediate interaction partners. 
These changes have a potential to cascade down a small-world PPI 
network affecting the equilibrium between bound and unbound con- 
centrations of progressively more distant neighbors including those in 
other functional modules. Most often such indiscriminate propaga- 
tion would represent an undesirable effect which has to be either tol- 
erated or corrected by the cell. On the other hand, a controlled trans- 
duction of reversible concentration changes along specific conduits 
may be used for biologically meaningful signaling and regulation. A 
routine and well known example of such regulation is inactivation of 
a protein by sequestration with its strong binding partner. 

In this study we quantitatively investigate how large concentra- 
tion changes propagate in the PPI network of yeast S. cerevisiae. We 



focus on the non-catalytic or reversible binding interactions whose 
equilibrium is governed by the Law of Mass Action (LMA) and do 
not consider irreversible, catalytic processes such as protein phos- 
phorylation and dephosphorylation, proteolytic cleavage, etc. While 
such catalytic interactions constitute the most common and best stud- 
ied mechanism of intracellular signaling, they represent only a rather 
small minority of all protein-protein physical interactions (for exam- 
ple, only ~5% links in the yeast network used in our study involve a 
kinase). 

Furthermore, the balance between free and bound concentrations 
of proteins matters even for irreversible (catalytic) interactions. For 
example, the rate of a phosphorylation reaction depends on the avail- 
ability of free kinases and substrate proteins which are both con- 
trolled by the LMA equilibrium calculated here. Thus perturba- 
tions of equilibrium concentrations considered in this study could 
be spread even further by other mechanisms such as transcriptional 
and translational regulation, and irreversible posttranslational protein 
modifications. 

Results 

To illustrate general principles on a concrete example, in this study 
we used a highly curated genome-wide network of protein-protein 
physical interactions in yeast (S. cerevisiae), which, according to the 
BIOGRID database |3|, were independently confirmed in at least two 
publications. We combined this network with a genome-wide dataset 
of protein abundances in the log-phase growth in rich medium, mea- 
sured by the TAP-tagged western blot technique [4]. Average pro- 
tein concentrations in this dataset range between 50 and 1,000,000 
molecules/cell with the median value around 3000 molecules/cell. 
After keeping only the interactions between proteins with known 
concentrations we were left with 4185 binding interactions among 
1740 proteins (Table SI). The BIOGRID database (3) lists all in- 
teractions as pairwise and thus lacks information about multi-protein 
complexes larger than dimers. Thus in the main part of this study 
we consider only homo- and hetero-dimers and ignore the formation 
of higher-order complexes. In the Supplementary materials we show 
that the reliable data on multi-protein complexes can be easily incor- 
porated into our analysis. Furthermore, we demonstrate that taking 
into account such complexes leaves our results virtually unchanged 
(see supplementary Table S4 and Fig. S3). 
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The state of the art genome-wide PPI datasets lack information 
on dissociation constants Kij of individual interactions. The only 
implicit assumption is that the binding is sufficiently strong to be 
detectable by a particular experimental technique (some tentative 
bounds on dissociation constants detectable by different techniques 
were reported recently |5|). A rough estimate of the average binding 
strength in functional protein-protein interactions could be obtained 
from the PINT database | 6|. This database contains about 400 exper- 
imentally measured dissociation constants between wildtype proteins 
from a variety of organisms. In agreement with predictions of Refs. 
(7] [8) the histogram of these dissociation constants has an approxi- 
mately log-normal shape. The average relevant for our calculations is 
that of the association constant {1/Kij) =l/(5nM). Common sense 
dictates that the dissociation constant of a functional binding between 
a pair of proteins should increase with their abundances. The ma- 
jority of specific physical interactions between proteins are neither 
too weak (to ensure a considerable number of bound complexes) nor 
unnecessarily strong. Indeed, there is little evolutionary sense in in- 
creasing the binding strength between a pair of proteins beyond the 
point when both proteins (or at least the rate limiting one) spend 
most of their time in the bound state. The balance between these 
two opposing requirements is achieved by the value of dissociation 
constant Kij equal to a fixed fraction of the largest of the two abun- 
dances d and Cj of interacting proteins. In our simulations we used 
Kij = max(Ci, Cj)/ '20 in which case the average association con- 
stant nicely agrees with its empirical value (l/(5nM)) observed in the 
PINT database 1 6 1 . In addition to this, perhaps, more realistic assign- 
ment of dissociation constants we also simulated binding networks 
in which dissociation constants of all 4185 edges in our network are 
equal to each other and given by InM, lOnM, lOOnM, and 1/jM. 
Numerical calculation of bound and free (unbound) equilib- 
rium concentrations. The Law of Mass Action (LMA) relates the 
free (unbound) concentration Fi of a protein to its total (bound and 
unbound) concentration d as 

Fl = l + Ejh/Kij ■ U] 

Here the sum is over all specific binding partners of the protein i with 
free concentrations Fj and dissociation constants Kij, While in the 
general case these nonlinear equations do not allow for an analytical 
solution for Fi, they are readily solved numerically e.g. by successive 
iterations. 

Concentration-coupled proteins. To investigate how large changes 
in abundances of individual protein affect the equilibrium through- 
out the PPI network we performed a systematic numerical study in 
which we recalculated the equilibrium free concentrations of all pro- 
tein nodes following a twofold increase in the total concentration of 
just one of them: C\ — > 2CV This was repeated for the source of 
twofold perturbation spanning the set of all 1740 of proteins in our 
network [9]. The magnitude of the initial perturbation was selected 
to be representative of a typical shift in gene expression levels or 
protein abundances following a change in external or internal condi- 
tions. Thus here we simulate the propagation of functionally relevant 
changes in protein concentrations and not that of background stochas- 
tic fluctuations. A change in the free concentration Fj of another 
protein was deemed to be significant if it exceeded the 20% level, 
which according to Ref. 1 10 1 is the average magnitude of cell-to-cell 
variability of protein abundances in yeast. We refer to such protein 
pairs i — » j as concentration-coupled. The detection threshold could 
be raised simultaneously with the magnitude of the initial perturba- 
tion. For example, we found that the list of concentration-coupled 
pairs changes very little if instead of twofold (+100%) perturbation 



and the 20% detection threshold one applies a sixfold (+500%) initial 
perturbation and twofold (100%) detection threshold. 

In general we found that lists of concentration-coupled pro- 
teins calculated for different assignments of dissociation constants 
strongly overlap with each other. For example, more than 80% 
of concentration-coupled pairs observed for the variable Kij = 
max(Ci , (7,-)/20 assignment described above were also detected for 
the uniform assignment Kij — const = lOnM (for more details see 
the supplementary table S3) This relative robustness of our results al- 
lowed us to use the latter conceptually simplest case to illustrate our 
findings in the rest of the manuscript. 

The complete list of concentration-coupled pairs is included in 
the supplementary materials. Given the incompleteness and uncer- 
tainty in our knowledge of the network topology, protein abundances, 
and values of dissociation constants, these lists provide only a rough 
estimate of the actual magnitude of perturbations that could be mea- 
sured experimentally. 
Central observations. We found that: 

• On average, the magnitude of cascading changes in equilib- 
rium free concentrations exponentially decays with the dis- 
tance from the source of a perturbation. This explains why, 
despite a globally connected topology, individual modules in 
such networks are able to function fairly independently. 

• Nevertheless, specific favorable conditions identified in our 
study cause perturbations to selectively affect proteins at con- 
siderable network distances (sometimes as far as four steps 
away from the source). This indicates that in general, such 
cascading changes could not be neglected when evaluating the 
consequences of systematic changes in protein levels, e.g. in 
response to environmental factors, or in gene knockout exper- 
iments. Conditions favorable for propagation of perturbations 
combine high yet monotonically decreasing concentrations of 
all heterodimers along the path with low free (unbound) con- 
centrations of intermediate proteins. While reversible protein 
binding links are symmetric, the propagation of concentration 
changes is usually asymmetric with the preferential direction 
pointing down the gradient in the total concentrations of pro- 
teins. 

Examples of multi-step cascading changes. In Fig. Q}\B we il- 
lustrate these observations using two examples. In each of these cases 
the twofold increase in the abundance of just one protein (marked 
with the yellow circle in the center of each panel) has significantly (> 
20%) affected equilibrium free concentrations of a whole cluster of 
proteins some as far as 4 steps away from the source of the perturba- 
tion. However, the propagation beyond immediate neighbors is rather 
specific. For example, in the case of SUP35 (Fig. [TJ\) only 1 out of 
169 of its third nearest neighbors were affected above the 20% level. 
Note that changes in free concentrations generally sign-altemate with 
the network distance from the source. Indeed, free concentrations of 
immediate binding partners of the perturbed protein usually drop as 
more of them become bound in heterodimers with it. This, in turn, 
lowers concentrations of the next-nearest heterodimers and thus in- 
creases free concentrations of proteins at distance 2 from the source 
of perturbation, and so on. 

Exponential decay with the network distance. The results of our 
quantitative network-wide analysis of these effects are summarized in 
Fig. [2] and Table [T] From Fig [2] one concludes that the fraction of 
proteins with significantly affected free concentrations rapidly (ex- 
ponentially) decays with the length L of the shortest path (network 
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distance) from the perturbed protein. The same statement holds true 
for bound concentrations if the distance is measured as the shortest 
path from the perturbed protein to any of the two proteins forming 
a heterodimer. Thus, on average, the propagation of concentration 
changes along the PPI network is indeed considerably dampened. On 
the other hand, from Table Q] one concludes that the total number of 
multi-step chains along which concentration changes propagate with 
little attenuation remains significant for all but the largest values of 
the dissociation constant. These two observations do not contradict 
each other since the number of proteins separated by distance L (the 
last column in TableQJ rapidly grows with L. 

Conditions favoring the multi-step propagation of perturba- 
tions. What conditions favor the multi-step propagation of perturba- 
tions along particular channels? In Fig. [3]\ we show a group of 
highly abundant proteins along with all binding interactions between 
them. Then on panel B of the same figure we show only those interac- 
tions that according to our LMA calculation give rise to highly abun- 
dant heterodimers (equilibrium concentration >1000 per cell). This 
breaks the densely interconnected subnetwork drawn in the panel A 
into 10 mutually isolated clusters. Some of these clusters contain 
pronounced linear chains which serve as conduits for propagation of 
concentration perturbations. The fact that perturbations indeed tend 
to propagate via highly abundant heterodimers is illustrated in the 
next panel (Fig. [3f ) where red arrows correspond to concentration- 
coupled nearest neighbors A— +B. Evidently, the edges in panels B 
and C largely (but not completely) coincide. Additionally, the panel 
C defines the preferred direction of propagation of perturbations from 
a more abundant protein to its less abundant binding partners. 

To further investigate what causes concentration changes to prop- 
agate along particular channels we took a closer look at eight three- 
step chains A — > A\ — > A2 — > B with the largest magnitude of per- 
turbation of the last protein B (twofold detection threshold follow- 
ing a twofold initial perturbation). The identification of intermediate 
proteins A\ and A2 was made by a simple optimization algorithm 
searching for the largest overall magnitude of intermediate perturba- 
tions along all possible paths connecting A and B. 

Inspection of the parameters of these chains shown in Fig. [4] 
allows one to conjecture that for a successful transduction of concen- 
tration changes, the following conditions should be satisfied: 

• Heterodimers along the whole path have to be of sufficiently 
high concentration Dij. 

• Intermediate proteins have to be highly sequestered. That is to 
say, in order to reduce buffering effects free-to-total concentra- 
tion ratios Fi /d should be sufficiently low for all but the last 
protein in the chain. 

• Total concentrations C\ should gradually decrease in the direc- 
tion of propagation. Thus propagation of perturbations along 
virtually all of these long conduits is unidirectional and fol- 
lows the gradient of concentration changes (a related concept 
of a "gradient network" was proposed for technological net- 
works in Ref. II 11 ). 

• Free concentrations Fi should alternate between relatively 
high and relatively low values in such a way that free con- 
centrations of proteins at steps 2 and 4 have enough "room" 
to go down. The two apparent exceptions to this rule visible 
in Fig. [4] may be optimized to respond to a drop (instead of 
increase) in the level of the first protein. 
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These findings are in agreement with our more detailed numerical 
and analytical analysis of propagation of fluctuations presented in 
1121 and illustrated for simple networks in the Supplementary mate- 
rials. In 1121 we demonstrated that the linear response of the LMA 
equilibrium to small changes in protein abundances could be approx- 
imately mapped to a current flow in the resistor network in which het- 
erodimer concentrations play the role of conductivities (which need 
to be large for a good transmission) while high Fi / d ratios result in 
the net loss of the perturbation "current" on such nodes and thus need 
to be minimized. 

Discussion 

Robustness with respect to assignment of dissociation con- 
stants. It has been often conjectured that the qualitative dynamical 
properties of biological networks are to a large extent determined by 
their topology rather than by quantitative parameters of individual 
interactions such as their kinetic or equilibrium constants (for a clas- 
sic success story see e.g. 1131 ). Our results generally support this 
conjecture, yet go one step further: we observe that the response of 
reversible protein-protein binding networks to large changes in con- 
centrations strongly depends not only on topology but also on abun- 
dances of participating proteins. Indeed, perturbations tend to pref- 
erentially propagate via paths in the network in which abundances of 
intermediate proteins monotonically decrease along the path (see Fig. 
[3). Thus by varying protein abundances while strictly preserving the 
topology of the underlying network, one can select different conduits 
for propagation of perturbations. 

On the other hand our results indicate that these conduits are 
to a certain degree insensitive to the choice of dissociation con- 
stants. In particular, we found (see Fig(5) that equilibrium concen- 
trations of dimers and the remaining free (unbound) concentrations 
of individual proteins calculated for two different Kij assignments 
(Kij — const = 5nM and Kij = max(Ci, Cj)/20 with the inverse 
mean of 5nM) had a high Spearman rank correlation coefficient of 
0.89 and even higher linear Pearson correlation coefficient of 0.98. 
The agreement was especially impressive in the upper part of the 
range of dimer concentrations (see Fig. [5). For example, the typical 
difference between dimer concentrations above 1000 molecules/cell 
was measured to be as low as 40%. As we demonstrated above it is 
exactly these highly abundant heterodimers that form the backbone 
for propagation of concentration perturbations. Thus it should come 
as no surprise that sets of concentration-coupled protein pairs ob- 
served for different Kij assignments also have a large (~ 70-80%) 
overlap with each other (see the supplementary table S3). 

Such degree of robustness with respect quantitative parameters 
of interactions can be partially explained by the following observa- 
tion: proteins whose abundance is higher than the sum of abundances 
of all of their binding partners cannot be fully sequestered into het- 
erodimers for any assignment of dissociation constants. As we ar- 
gued above, such proteins with substantial unbound concentrations 
considerably dampen the propagation of perturbations, and thus can- 
not participate in highly conductive chains. Another argument in fa- 
vor of this apparent robustness is based on extreme heterogeneity of 
wildtype protein abundances (in the dataset of Ref. |4| they span 5 
orders of magnitude). In this case concentrations of heterodimers de- 
pend more on relative abundances of two constituent proteins than on 
the corresponding dissociation constant (within a certain range). 

In a separate numerical control experiment we verified that the 
main results of this study are not particularly sensitive to false 
positives and false negatives in the network topology inevitably 
present even in the best curated large-scale data. The percentage of 
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concentration-coupled pairs surviving a random removal or addition 
of 20% of links in the network generally ranges between 60% and 
80% (see supplementary table S2). 

Genetic interactions. The effects of concentration perturbations 
discussed above could explain some of the genetic interactions be- 
tween proteins. Consider for example a "dosage rescue" of a protein 
A by a protein B, or the correction of an abnormal phenotype caused 
by deletion or other type of inactivation of A by overexpression of 
B. One possible mechanism behind this effect is that the knockout 
of A and overexpression of B affect the LMA equilibrium in oppo- 
site directions and to some extent cancel one another. In order for 
this mechanism to be applicable (albeit tentatively), concentrations 
of both A and B must be simultaneously coupled (in the sense used 
throughout this work) to at least one crucial protein C whose free or 
bound concentration has to be maintained at or close to wildtype lev- 
els. To assess this hypothesis, we analyzed the set of 772 dosage res- 
cue pairs involving proteins from the PPI network used in this study 
of 2531 dosage rescue pairs listed in the BIOGRID database |3J. For 
136 pairs (or 18% of all dosage rescue pairs), we were able to iden- 
tify one or more putative "rescued" protein whose free concentration 
was considerably (by >20%) affected by changes in abundances of 
both A and B (see supplementary Table S5). This overlap is highly 
statistically significant, having the Fisher's exact test p- value around 
10" 216 . Even more convincing evidence that perturbations to the 
LMA equilibrium state cause some of genetic interactions is pre- 
sented in Figure [6] It plots the fraction of protein pairs at distance L 
from each other in the PPI network that are known to dosage rescue 
each other. From this figure one concludes that proteins separated 
by distances 1,2, and 3 are significantly more likely to genetically 
interact with each other than one expects by pure chance alone (the 
expected background level is marked with a dashed line or better yet 
visible as a plateau for large values of L). Furthermore, the slope 
of the exponential decay in the fraction of dosage rescue pairs as a 
function of L is roughly consistent with that shown in Fig. [2] for the 
fraction of concentration-coupled pairs. 

Possibility of functional signaling and regulation mediated 
by multi-step reversible protein interactions. Another intrigu- 
ing possibility raised by our findings is that multi-step chains of re- 
versible protein-protein bindings might in principle be involved in 
meaningful intracellular signaling and regulation. There are many 
well-documented cases in which one-step "chains" are used to re- 
versibly deactivate individual proteins by the virtue of sequestration 
with their binding partner(s). An example involving a longer regula- 
tory chain of this type is the control of activity of condition-specific 
sigma factors in bacteria. In its biologically active state, a given 
sigma factor is bound to the RNA polymerase complex. Under nor- 
mal conditions it is commonly kept in an inactive form by the virtue 
of a strong binding with its specific anti-sigma factor (anti-sigma fac- 
tors are reviewed in 1141 ). In several known cases the concentration 
of the anti-sigma factor in turn is controlled by its binding with the 
specific anti-anti sigma factor 1141 . The existence of such experi- 
mentally confirmed three-step regulatory chains in bacteria hints at 
the possibility that at least some of the longer conduits we detected 
in yeast could be used in a similar way. 

Application to microarray data analysis. In order to unequivo- 
cally detect cascading perturbations, in our simulations we always 
modified the total concentration of just one protein at at time. In 
more realistic situations, expression levels of a whole cluster of genes 
change, for example, in response to a shift in environmental condi- 
tions. Our general methods could be easily extended to incorporate 



this scenario. With the caveat that changes in expression levels of 
genes reflect changes in overall abundances of corresponding pro- 
teins, our algorithm allows one to calculate the impact of an exter- 
nal or internal stimulus measured in a microarray on free and bound 
concentrations of all proteins in the cell. Including such indirectly 
perturbed targets could considerably extend the list of proteins af- 
fected by a given shift in environmental conditions. Simultaneous 
shifts in expression levels of several genes may amplify changes of 
free concentrations of some proteins and/or mutually inhibit changes 
of others. 

Effects of intracellular noise. Another implication of our findings 
is for intracellular noise, or small random changes in total concen- 
trations d of a large number of proteins. The randomness, smaller 
magnitude, and the sheer number of the involved proteins character- 
ize the differences between such noise and systematic several-fold 
changes in the total concentration of one or several proteins consid- 
ered above. Our methods allow one to decompose the experimen- 
tally measured 1 10 1 noise in total abundances of proteins into biolog- 
ically meaningful components (free concentrations and bound con- 
centrations within individual protein complexes). Given a fairly small 
magnitude of fluctuations in protein abundances (on average around 
20% (lO'l), one could safely employ a computationally-efficient lin- 
ear response algorithm (see 1121 "). Several recent studies 1151 , 1161 . 
1 10] distinguish between the so-called extrinsic and intrinsic noise. 
The extrinsic noise corresponds to synchronous or correlated shifts in 
abundance of multiple proteins which, among other things, could be 
attributed to variation in cell sizes and their overall mRNA and pro- 
tein production or degradation rates. Conversely, the intrinsic noise is 
due to stochastic fluctuations in production and degradation and thus 
lacks correlation between different proteins. We found that extrin- 
sic and intrinsic noise affect equilibrium concentrations of proteins 
in profoundly different ways. In particular, while multiple sources of 
the extrinsic noise partially (yet not completely) cancel each other, 
intrinsic noise contributions from several sources can sometimes add 
up and cause considerable fluctuations in equilibrium free and bound 
concentrations of particular proteins (see Figure|7). 

Limitations of the current approach and directions for fur- 
ther studies. In our study we used a number of fundamental ap- 
proximations and idealizations including the assumption of spatially 
uniform concentrations of proteins, the neglect of temporal dynamics 
or, equivalently, the assumption that all concentrations have sufficient 
time to reach their equilibrium values, the continuum approximation 
neglecting the discrete nature of proteins and their bound complexes, 
etc. Another set of approximations was mostly due to the lack of 
reliable large-scale data quantifying these effects. They include not 
taking into account the effects of cooperative binding within multi- 
protein complexes, using a relatively small number (81) of well cu- 
rated multi-protein complexes used in our study (see supplementary 
materials), neglecting systematic changes in protein abundances in 
the course of the cell cycle, etc. We do not expect these effects to sig- 
nificantly alter our main qualitative conclusions, namely, the expo- 
nential decay of the amplitude of changes in equilibrium concentra- 
tions, the existence of 3-4 step chains that nevertheless successfully 
propagate concentration changes, and the general conditions that en- 
hance or inhibit such propagation. 

In the future we plan to extend our study of fluctuations in equi- 
librium concentrations by incorporating the effects of protein dif- 
fusion (non-uniform spatial concentration) and kinetic effects. An- 
other interesting avenue for further research is to apply the concept of 
"potential energy landscape" (for definitions see 1171 and references 
therein) to reversible processes governed by the law of mass action, 
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such as e.g. the equilibrium in protein binding networks. In the past 
this concept was applied to processes involving catalytic, irreversible 
protein-protein interactions such as e.g. phosphorylation by kinases 
or regulation by transcription factors. In this case it helped to reveal 
the robustness of regulatory networks in the cell cycle 1 18 1 and in a 
simple two-protein toggle switch 1 19 1. 

Methods 

Source of interaction and concentration data The curated PPI 
network data used in our study is based on the 2.020 release of the 
BIOGRID database |3|. We kept only pairs of physically interacting 
proteins that were reported in at least two publications using the fol- 
lowing experimental techniques: Affinity Capture-MS (28172 pairs), 
Affinity Capture-RNA (55 pairs), Affinity Capture-Western (5710 
pairs), Co-crystal Structure (107 pairs), FRET (43 pairs), Far West- 
ern (41 pair), Two-hybrid (1 1935 pairs). That left us with 5798 non- 
redundant interacting pairs. Further restriction for both proteins to 
have experimentally measured total abundance |4| narrowed it down 
to 4185 distinct interactions among 1740 yeast proteins. 

The list of manually curated yeast protein complexes was ob- 
tained from the latest release (May 2006) of the MIPS CYGD 
database [20, 21 1. The database contains 1205 putative protein com- 
plexes 326 of which are not coming from systemic analysis studies 
(high-throughput MS experiments). In the spirit of using only the 
confirmed PPI data we limited our study to these manually curated 
326 complexes. For 99 of these complexes the MIPS database lists 3 
or more constituent proteins. After elimination of proteins with un- 
known total concentrations we were left with 8 1 multi-protein com- 
plexes. 
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Genetic interactions of dosage rescue type were also obtained 
from the BIOGRID database. There are 772 pairs of dosage rescue 
interactions among 1740 proteins participating in our PPI network 
(the full list contains 2531 dosage rescue pairs). 

Numerical algorithms. The numerical algorithm calculating 
all free concentrations Fi given the set of total concentrations 
d and the matrix of dissociation constants Kij was imple- 
mented in MATLAB 7.1 and is available for downloading on 
|http : / /www . cmth . bnl . gov/~maslov/programs . htm| It 
consists of iterating the Eq. Q] starting with Fi = CV Iterations stop 
once relative change of free concentration on every node in the course 
of one iteration step becomes smaller than 10 -8 which for networks 
used in our study takes less than a minute on a desktop computer. 
When necessary, multiprotein complexes are incorporated into this 
algorithm as described in the Supplementary Materials. 

The effects of large concentration perturbations was calculated 
by recalculating free concentrations following a twofold increase in 
abundance of a given perturbed protein. The effects of small pertur- 
bations such as those of concentration fluctuations were calculated 
using the faster linear response matrix formalism described elsewhere 
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Fig. 1. Two cases of propagation of large concentration changes in the yeast protein binding network. The total (bound + unbound) concentration of the protein marked 
with the yellow circle (the SUP35 protein (A), the SEC27 protein (B)) was increased twofold from its wildtype value in the rich growth medium 4 . Red and green circles mark 
all other proteins whose equilibrium free (unbound) concentrations have increased (green) or decreased (red) by more than 20%. The area of each circle is proportional to the 
logarithm of the change in free concentration. Edges show all physical interactions among this group of proteins with the shade of gray proportional to the logarithm of the 
equilibrium concentration of the corresponding dimer calculated for Kij = const = lOnM. 

Table 1. The number of concentration-coupled pairs of yeast proteins separated by network distance L. Numerical 
simulations (twofold initial perturbation, 20% detection threshold) were performed for different assignment of dissociation 
constants: Kij = max(Ci, Cj)/20 (column 2), Kij — const =lnM, lOnM, O.l/xM.l^iM (columns 3-6). The column 7 lists the 
total number of protein pairs at distance L. 
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Fig. 3. A) All binding links between a subset of 312 highly abundant proteins. B) Binding links characterized by high concentration of heterodimers (> 1000 molecules/cell). 
The level of gray of binding links scales with the logarithm of concentration of the corresponding heterodimer. C) Concentration-coupled proteins A — * B with the property that 
a twofold increase in the abundance A reduces free concentration of its immediate binding partner B by 20% or more. Note that links roughly coincide with highly abundant 
dimers shown in the panel B. Arrows reveal the preferential direction of propagation of perturbations. 




Fig. 4. Parameters of the eight three-step chains that exhibit the best transduction of concentration changes: Heterodimer concentrations D^j (A) for three binding links 
along the chain. Total concentrations Oj (B) and free-to-total concentration ratios F\/Ci (C) of the four proteins involved in these chains. Dashed lines correspond to 
network-wide geometric averages of the corresponding quantities: {Dij} ~ 100 copies/cell, (Ci) ~ 3000 copies/cell, and {Fi/Ci) = 13%. 
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Fig. 5. The scatter plot of 4185 bound concentrations Dij (panel A) and 1740 free concentrations Fi (panel B) calculated for two different assignments of dissociation 
constants to links in the PPI network. The x-axis was computed for the homogeneous assignment Kij = const = 5nM , while the y-axis was computed for the heterogeneous 
assignment K%j = max(Cj , Cj ) /20 with the same average strength. The dashed lines along the diagonals are drawn at X = y, while the horizontal and vertical solid lines 
denote the concentration of 1 molecule/cell. Note that equilibrium concentrations in the upper part of their range (e.g. above 1000 molecules/cell) are nearly independent of 
the choice of Kij . Also, our choice of heterogeneous assignment nearly eliminates free or bound concentrations in a biologically unreasonable range <1 molecules/cell 
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Fig. 6. The fraction of dosage rescue protein pairs separated by distance L in the PPI network. Note that pairs at distances 1,2 and 3 are significantly overrepresented 
over the background level marked with dashed line (772/1740^) or visible as a plateau at large distances L. The exponential decay constant at low values of L is consistent 
with that in Fig. [2] 
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Fig. 7. The magnitude of extrinsic (panel A) and intrinsic noise in free concentrations F$ of proteins when their total concentrations d fluctuate by 20%. In this plot we 
used Kij = const = InM. One can see that while the extrinsic noise is suppressed in the low concentrations region, the intrinsic one is uniformly high and reaches as much 
as >300% in the mid-i^ range. 



Footline Author 



PNAS | Issue Date | Volume | Issue Number | 11 



